Corpus: urd-in_web_2015_10K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 47 91 97 97 97
1000 68 121 129 130 130
10000 1110 4466 7652 8782 8985
100000 1110 4466 7653 8783 8986
1000000 1110 4466 7653 8783 8986


Zipf's diagram for sentence endings


Gnuplot diagram

2261 msec needed at 2018-06-29 23:09